Estimation of speech formant-dynamics using neural networks
نویسندگان
چکیده
Formant Dynamics is an interesting research field in Speech Perception, Speech Parameterizacion, Synthesis, and Recognition. The Spectrogram shown in Fig. 1.a, corresponding to the utterance “I wished you were here a year ago” reveals important dynamic changes in the positions of the poles in the Transfer Function of the Vocal Tract with time. In the associated template shown in Fig. 1.b, the set of PARCOR parameters generating the Spectrogram may be seen. As it could be expected, the changes in the Spectrogram are related with changes in the PARCORgram. In many applications, such as Phone Labelling [Robinson.94], or Formant Tracing [Nagayama.94], it should be desirable to detect the set of Vector Parameters responsible for these changes, or to inferr Formant Dynamics from Vector Parameters. Through the present work one such application using Gradient-Adaptive Lattices and Time-Delay Neural Networks will be described.
منابع مشابه
Formant Estimation and Tracking Using Deep Learning
Formant frequency estimation and tracking are among the most fundamental problems in speech processing. In the former task the input is a stationary speech segment such as the middle part of a vowel and the goal is to estimate the formant frequencies, whereas in the latter task the input is a series of speech frames and the goal is to track the trajectory of the formant frequencies throughout t...
متن کاملDiphone Synthesis Using a Neural Network
A neural network is used to produce formant data for the Holmes parallel formant speech synthesizer [2] from an allophonic transcription of plain english text. This paper presents results obtained from training a back propagation neural network using speech generated by a conventional speech synthesizer. The network is able to learn to reproduce the basic form of formant transitions between all...
متن کاملAcoustic-to-phonetic mapping using recurrent neural networks
This paper describes the application of artificial neural networks to acoustic-to-phonetic mapping. The experiments described are typical of problems in speech recognition in which the temporal nature of the input sequence is critical. The specific task considered is that of mapping formant contours to the corresponding CVC' syllable. We performed experiments on formant data extracted from the ...
متن کاملThe Application of Neural Networks to Cognitive Phonetic Modelling
A neural network is used to generate control parameters for a parallel formant speech synthesizer, corresponding to a sequence of allophone tokens. Training is to be accomplished using formant data obtained from both natural and synthetic speech. It is intended that theories of cognitive phonetics, currently being developed in the Department of Language and Linguistics at the University of Esse...
متن کاملEstimation of Industrial Production Costs, Using Regression Analysis, Neural Networks or Hybrid Neural - Regression Method?
Estimation (Forecasting) of industrial production costs is one of the most important factor affecting decisions in the highly competitive markets. Thus, accuracy of the estimation is highly desirable. Hibrid Regression Neural Network is an approach proposed in this paper to obtain better fitness in comparison with Regression Analysis and the Neural Network methods. Comparing the estimated resul...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995